1/14 
T. SHIBUYA 
JP920030020US1 (CPA) 



FIG. 1 



(a) 



TARGET NUCLEOTIDE SEQUENCE T 



GTAGTCAGGCCAT 



COMPLEMENTARY 
SEQUENCE Q 



GTCAGGCCAT 1 



COMMON SUBSEQUENCE? 



HYBRIDIZATION 



PROBE P ATGGCCTGAC - 



(b) 



SUBSEQUENCE S ATGCGCG— TAAGT 



COMPLEMENTARY 
SEQUENCE Q 



ATG CGTTAACT 

TWICE INSERTION 

ONCE INSERTION 

ONCE SUBSTITUTION 



RESEMBLANCE r = 0.6364 
MAXIMUM EDIT DISTANCE k = 4 



2/14 
JP920030020US1 




3/14 
JP920030020US1 



FIG. 3 



RECEIVE TARGET NUCLEOTIDE SEQUENCE Ti 
AND NUCLEOTIDE SEQUENCE OF PROBE P 



DETERMINE COMPLEMENTARY SEQUENCE 
AND RECEIVE EDIT DISTANCE k 



EVALUATE BINDING POSSIBILITY 
(lorgest_prefix_length_with_common_subsequence) 



STORE EVALUATION RESULT AND 
DETERMINE BINDING POSSIBILITY 
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FIG. 4 



RECEIVE TARGET NUCLEOTIDE 
SEQUENCES T1 TO Tn AND 
NUCLEOTIDE SEQU ENCE OF PROBE P 
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FIG. 5 

function: largest_prefix_with_common_subsequence(T,Q,k) 



INITIALIZATION STEP 
I 



S40 








EVALUATION STEP 


^■S52 


^^jLrS46 




min_k>k "^>> 


YES 


^^NO 




probe_position= 
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THERE IS POSSIBILITY OF PRESENCE 
OF COMMON SUBSEQUENCE HAVING 
CHAIN LENGTH OF m-k OR LONGER, 
SO OUTPUT POSITIONS(k) 
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THERE ARE NO COMMON 
SUBSEQUENCES WITH LONGER THAN 
m-k, SO OUTPUT n+1 
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FIG. 9 



I /function largest_prefix_with_common_subsequence 

largest_prefix_with_common_subsequence(sequence T[1..n], sequence Q[1..m], k) | 
for (i=0;i<=k;i++) positions[i] = 0; 
min_k = 0; 

for (probe_position=1; probe_position<=m; probe_position++) j 
max_k = min jk, probe_positionj; 
for (i=max_k; i>=min_k ;i — ) | 
if (i==0 | 

positions[i] = next_char_position(T, positions[i]+1, 

Q[probe_ position]); 

| else | 

positions))] = min \ next_char_position(T, positions[i]+1, 

Q[probe_position]), 

positions[i-1] | 

i 

if (positions[i] > n ) I 

min_k = i+1; 

i 

i 

if (min_k > k) | 

(T) Absence of common subsequence with length of m-k or longer 
return m+1; 

i 

i 

(2) Presence of common subsequence with length of m-min_k or longer 
return positions(k) 
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FIG. 1 0 



//function check_ target 

check. target(sequence T[1..n], sequence Q[l..m],k, overlap_length, check_length) j 
position=1; 

while(position <= n) \ 

length = largest_prefixJength_with_common_subsequence( 

T[position..n], Q[1..m], k); 
if (length < check_length) | 
if (check_exactly( 

T[position..(min(n, position+checkjength))], Q, k)) | 
position = position + checkjength; 
| else | 

return ("T binds to P") 

i 

| else if (length -1 <= n - position + 1)| 

position = position + length - overlapjength - 1; 
| else { 

position = n+1; 

t 

! 

return ("T does not bind to P n ); 
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FIG. 12 

(a) 

EDIT DISTANCE k=3; positions[0] 
m 



COMPLEMENTARY SEQUENCE 




TARGET NUCLEOTIDE SEQUENCE C GCGCATGAA 

i 1 2345678 9 10 

[positions[0],m]= (positions[0], 2, 3, 5, - -, -, -) 



(b) 

EDIT DISTANCE k=3; positional] 



m 

COMPLEMENTARY SEQUENCE 

TARGET NUCLEOTIDE SEQUENCE 

i 



1 2 3 4 5 6 7 8 

©©©©®©©© 

C GCGCATGAA 
1 2 3 4 5 6 7 8 9 10 



[positions[1],m]= (positions[1], 1, 3, 5, 6, 7, 8) 
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FIG. 1 6 
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